NeuTTS Air is the world's first ultra-realistic, edge-side text-to-speech (TTS) language model with instant voice cloning capabilities. Built on a large language model backbone with 0.5B parameters, it brings natural voice, real-time performance, built-in security, and speaker cloning capabilities to local devices.
Audio Processing
Gguf